CDS

Accession Number TCMCG021C29885
gbkey CDS
Protein Id XP_019702878.2
Location join(88905..88999,89499..89625,89796..89833,92239..92332,93351..95644,97171..97302,97470..97542,97851..97890,98155..98227,99291..99356,102643..102720,105407..105579,118979..119025,119170..119413,119498..119793)
Gene LOC105035387
GeneID 105035387
Organism Elaeis guineensis

Protein

Length 1289aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA268357
db_source XM_019847319.2
Definition uncharacterized protein LOC105035387 isoform X2 [Elaeis guineensis]

EGGNOG-MAPPER Annotation

COG_category L
Description DNA binding domain with preference for A/T rich regions
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko03021        [VIEW IN KEGG]
KEGG_ko ko:K15200        [VIEW IN KEGG]
EC -
KEGG_Pathway -
GOs -

Sequence

CDS:  
ATGGATGCTATTTTGGCTCTTTGTGGGGAGCCTGGCGAGCAGCACTTTGAGTCTGGCGAGATTCAGCGGTTCTCTTCCATGATCACCTTTTTGAAAGAGTGGAGGCATTTCTATTATGAACCAAAAATTATCAACTTTTCTTTTGAAACTGACTCTGCACGAGTGAAAGTTGTTCCTGATGGGATACACTTATCTCAGTTTTCATCTGCAGCTGTTCCGAAGATGGGACAACTGCCTCGTAAAACAAGAAATTCAGACAGCAATGATTTTGTCATGCATGCTGGTGGGCATGTCTGGGGATTGGACTGGTGTCCCCAGATCCATGAAAAGCCTCATTCTAATATCAAATGTGAGTATCTTGCAGTTGCTGCTCACCCTCCTGGTTCTACATATCACAAGATAGGTGTCCCATTGATTGGAAGAGGTGCCATTCAAATCTGGTGCCTCTTAACTTTGGATGAGAAAGTGGAATTTTCTCTACCAAGGTCCAAAAGAGGGAGACCTAAAAAGGAACCCGTTAAAGAAGAACCACTAAATGATTTTAATGGTACAGGTATGGCAATAACCGCCAAAAGGTCCAGAGGCAGACCAAGGAAAAGGCCAGTTGAAAATGATACAAAATATGCTTTAAGTGTGGAAGATGGATCAGATCTACCAAGCCCAAGTTGGAGACCTAAAAAGAGACTGATGCTGGGTGTAGTTGATCTAAATGGTTCAAAGAAATTATCTCCAGCAAAGCCTAGAGGGAGACCCAGGAAAAAACCAACTTCTGATAACAACAGTGTACAAAAATCTTTTCTTGCCAAGCCCAGAGGGAGACCTAGAAAACATTCGCCTCCAAGCATTGATAATTCAAATGACAAAGATGTCTCACGTCCTTGCAGTAACAATCAAATTCAGATTGTAAGTGAGTCTAATGTGTGTACAACTGTTAATTCAGGGAATAATGTAATGGCATTGTCCTTTTCTGCTGATGTAAATTGTGGGGAGGTAACAATTCAAAAAAGGTGCAGAGGAAGACCCAGAAAGAATTCCATTTCAACTGTAAATGATCATGTTCCAGAATCTGGGGTTGAATCAGGGAATGGTACATCTTTTTTGGCCACTTCAAGCAGATCTGAGACTTTGGACATGAATGAATCATTTTTATGCAGTAACAATGAGATTCAAAGTGCTGTTGATTTGGGGAATATTGCAGTGGCATCCCCTGTTTCTGCTGATGTAAATTGTGAGGAGGCAACAATTCAAAAGAGGTGCAGAGGAAGACCTAGAAAAAACTCCATTTCAAATGTAAATGAACACATTCCAGCATCTGGTGTTGAATCAGTGCATTGTACATCTTCCTTGGTCACTGCAACCAGACCTGAGACTTTGAACATGAATGAATCATGTTCATGCAGTAACAATCAGATTCCAAGTGCTGTTGATTTGGGGAATATTGCATTGGCATTGCCTGTTTCTGCCGATGTAAACTTTGAGGAGGGAATGATTCAAAAAAGATGCAGAGGAAGACCTAGAAAGAACTCCATTTCAGATGCAAATGAACACATTCCAACATCTGGTGTTGAATCAGGAAATGGTACATCTTCCTTGGCCACTTCAACCAGACCTGAGACTTTGAACATGAAAGGATCATTTTTATGCAGTCACAATCAGATTCTAAGTGCTCTTGATTCGGGGAATACTGCATTGCCATCTCCTGTTTCTGCTGATGTAAATTGTGAGGAGGGATCAATTGAAAGAAGATGCGAAGAAAACATCTCAAGTGTAAATGAATGTGTTCCAGCATCTGCTGTTGTATTAGGGAATGGTACATCTTCCTTAGCCACTCCAAGCAGATCTGAGACTTTGAACATTAATGAATCATGTCTATGCTGTAACAGTCAAATCCGAAGCACTGGTGAGTGTGTTCTGCATTTAACTGTCGAATCGGGGAATGCTGCATCAGCCTTACCTGTTTCTGCTGATGCACACTGTAATGAGGGAACGTGTCCTCCAAGGCGTAGAGGGCGACCTCGAAAGAGGCCACTTCCAACTATAAACAAGTGTGTTATGGCATCCGGTGTTGAATCAGGGAATGATGTATCTGTATTGCCAACTTGTAGCAGACCTGGTATTTCCAGTGTAGACAAATCACCTCTATTTAGTAATAGTCAAACTCTAAATGGAAGTGAGGGTTTTCTTCCTTGTGATCCAGGGAATTTTGGATTGGCATCATCTGATTCTGTTGATGTAAATTGTAAGGTGGATACAATTCAACAAAGGCACAGAGGGAGACATAGAAAGCAGCTAGTTTTAAGCTTGAACAAATGTTTTCTGGAATCTGGAGTTGAATCAGTGGATGATACATTAGCATTGCCCACTTCTAGAGGACCTGAGACATTGGATGTAGTTGAATCACCTCTGTACGGCAATTCTCAGGATGCGATGCTCTTAAGTAATGAAGCGGGCTGTGAGAGCTCATCTAAAGCTGACTTAACTAGTTTAATTCCAAGAGACATTGCTTTGCCCAGGGTTGTACTCTGTCTAGCTCACAATGGGAAAGTTGCATGGGATGTGAAATGGAGACCTTGCACCATCAACGATTCAGAAGGCATGCATCATATGGGTTATCTTGCTGTATTGTTGGGAAATGGTTCTCTGGAAGTGTGGGAAGTCCCAGCCCCTAGCATTGTCAAAGTTTTCTTTGCTTCTAGCTGCAGTGAGGGTACTGATCCTCGTTTTTTGAAATTGGAACCTGTATTCAGATGCTCAAAGGTGAAATGTGGAGATCGACACAGCATTCCTCTGACAATGGAGTGGTCACCTTCTGCCCCGCATGATCTAATATTAGCTGGATGCCATGATGGAACGGTTGCCTTGTGGAAGTTTGCTAAACAATATCCATCTCAAGATACAAAACCTTTACTTTGCGTCACGGCTGATTCTGCTCCTATAAGAGCACTTGCTTGGGCTCCAGAGGAAAGTGATAAGGAGAGTGCAAATCTTTTTGTGACTGCTGGACATGAAGGTTTAAAATTTTGGGACCTGCGTGATCCATACCGTCCGCTATGGGACTTGAATCCCACGCCAAGAGCAATTTTGAGCGTGGATTGGGTAAAACATCCTAGATGTATCGTCTTATCACTTGATGATGGAACCTTGAGGATCCTCAGCTTGTGGAAAGCAGCATATGATGTTCCTGTTACTGGAAGACCGTTTGCTGGAACAAAGTATCAAGGGCTGCATAACTTTGGCTGCTCATCTTTCGCCGTTTGGAGTGCCCAGGTGTCACGAACTCTAGGTCTCGTTGCTTATTGTAGTGCAGATGGATCTACAGTTCGGTTTCAGCTTACTGAAGCTGTGGACAAAGATCCAAAGCGAAACCCTAAACCGCATTTCCTTTGCGGGTCACTTATGGAAAAAGGCCAAGTTCTTGAGATCAATAGTCCGCTACCTGATGTTCCATTGCCCAACATTCCTTTTGTGCAAAAGAAGTCGGTTGATGACTGTGTGGACACTGCTCCGACCATGCAGTTGCATGGTTGCTTGTCAGATGTGGACCAGGCAAAACAAACAGGTCATGCTGTTTCAGGTAGTGAAGAAACAATGGGAAATACAACATCAAAATCCAGAAAGAATGAGAGGAAGAAACAGCATGCAAGTGCTATTGCTGTGCAAACAAAATTTCATGCTGAAATAGAGCAAGGGATATTGCAAAGAAACGAAAACAAAGATGAAGGATCTCCACAACAGTTTGAAGCACACCCTCCCAAAGTTGTGGCTATGCATAGGGTAAGGTGGAACATGAACAGAGGGAGTGAAAGATGGTTGTGTTACGGTGGAGCTGCAGGCATCATTCGATGTCAGCAAGTTTCTTTGCAAATGTAG
Protein:  
MDAILALCGEPGEQHFESGEIQRFSSMITFLKEWRHFYYEPKIINFSFETDSARVKVVPDGIHLSQFSSAAVPKMGQLPRKTRNSDSNDFVMHAGGHVWGLDWCPQIHEKPHSNIKCEYLAVAAHPPGSTYHKIGVPLIGRGAIQIWCLLTLDEKVEFSLPRSKRGRPKKEPVKEEPLNDFNGTGMAITAKRSRGRPRKRPVENDTKYALSVEDGSDLPSPSWRPKKRLMLGVVDLNGSKKLSPAKPRGRPRKKPTSDNNSVQKSFLAKPRGRPRKHSPPSIDNSNDKDVSRPCSNNQIQIVSESNVCTTVNSGNNVMALSFSADVNCGEVTIQKRCRGRPRKNSISTVNDHVPESGVESGNGTSFLATSSRSETLDMNESFLCSNNEIQSAVDLGNIAVASPVSADVNCEEATIQKRCRGRPRKNSISNVNEHIPASGVESVHCTSSLVTATRPETLNMNESCSCSNNQIPSAVDLGNIALALPVSADVNFEEGMIQKRCRGRPRKNSISDANEHIPTSGVESGNGTSSLATSTRPETLNMKGSFLCSHNQILSALDSGNTALPSPVSADVNCEEGSIERRCEENISSVNECVPASAVVLGNGTSSLATPSRSETLNINESCLCCNSQIRSTGECVLHLTVESGNAASALPVSADAHCNEGTCPPRRRGRPRKRPLPTINKCVMASGVESGNDVSVLPTCSRPGISSVDKSPLFSNSQTLNGSEGFLPCDPGNFGLASSDSVDVNCKVDTIQQRHRGRHRKQLVLSLNKCFLESGVESVDDTLALPTSRGPETLDVVESPLYGNSQDAMLLSNEAGCESSSKADLTSLIPRDIALPRVVLCLAHNGKVAWDVKWRPCTINDSEGMHHMGYLAVLLGNGSLEVWEVPAPSIVKVFFASSCSEGTDPRFLKLEPVFRCSKVKCGDRHSIPLTMEWSPSAPHDLILAGCHDGTVALWKFAKQYPSQDTKPLLCVTADSAPIRALAWAPEESDKESANLFVTAGHEGLKFWDLRDPYRPLWDLNPTPRAILSVDWVKHPRCIVLSLDDGTLRILSLWKAAYDVPVTGRPFAGTKYQGLHNFGCSSFAVWSAQVSRTLGLVAYCSADGSTVRFQLTEAVDKDPKRNPKPHFLCGSLMEKGQVLEINSPLPDVPLPNIPFVQKKSVDDCVDTAPTMQLHGCLSDVDQAKQTGHAVSGSEETMGNTTSKSRKNERKKQHASAIAVQTKFHAEIEQGILQRNENKDEGSPQQFEAHPPKVVAMHRVRWNMNRGSERWLCYGGAAGIIRCQQVSLQM